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PREFACE 


The Agriculture and Resources Inventory Surveys Through Aerospace Remote 
Sensing is a multiyear program of research, development, evaluation, and 
application of aerospace remote sensing for agricultural resources, which 
began in fiscal year 1980. This program is a cooperative effort of the 
U.S. Department of Agriculture, the National Aeronautics and Space 
Administration, the National Oceanic and Atmospheric Administration (U.S. 
Department of Commerce), the Agency for International Development (U.S. 
Department of State), and the U.S. Department of the Interior. 
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A comparison of selected characteristics of 
Landsat's mul tispectral scanner (MSS) and 
Metsat's advanced very high resolution 
radiometer (AVHRR) in the local area coverage 
(LAC) mode. 

Cloud density codes and their derivations. 

The cloud code asigned to an 10 grid cell was 
dependent on the K/I ratio; i.e. the ratio of 
the remaining pixels after applying a threshold 
to the total number of pixels in the cell. 

This table is a sunmary of descriptive statistics 
for the three types of unscreened (a) and screened 
(b) EVI VIN's for the July observations, over 
II 1 inois. 
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FIGURES 


Figure , Page 

1-1 This plot shows the highly variable scene mean 1-4 

EVI's, uncorrected, over the Illinois study site 
in July, 1980. 

1-2 The study cells. This site consists of 81 IJ grid 1-8 

cells, as shown, in the southern three quarters 
of Illinois. 

4-1 A graph of the cloud indicator index, calculated 4-3 

from channels 3 and 4 against the cloud density 
code, computed from a threshold base on channels 
1 and 2. The greater the positive cloud density 
code the greater the number of cloud contaminated 
pixels within a cell. Notice that a secondary 
ordinate placed at CII=37.0 places most of the 
cloud contaminated cells to the left (CII<37.0) 
and most of the contaminated cells to the right 
(CII>37.0). Refer to table 4»1 for an explanation 
of the cloud density code. 

4-2 (a) - a graph of midcell pixel numbers against 4-5 

EVI's calculated from the July channel means, 
n “ 648 (b) is the same July EVI against midcell 
pixel numbers with a Cl I threshold of < 37.0 and 
a midcell pixel number threshold of <200, n = 312. 

Notice the reduction of variability of the EVI 
graph (b) compared to graph (a). 

4-3 Plots of the July Channel 1 cell means against 4-7 

channel 2 cell means without (a) and with (b) 
thresholds of CII < 37.0 and midcell pixel number 
$ 200. Notice how the bulk of the tightly 
clustered data with low reflectance in channels 
1 and 2 remained after thresholds were applied 
and the less clustered more amalgamous data has 
been omitted. 

4-A' Plots of the July channel 3 cell means against 4-9 

channel 4 cell means without (a) and with (b) 
thresholds of CII< 37.0 and midcell pixel numbers 
< 200. Notice how the bulk of the tightly 
clustered data remained after the thresholds 
were applied and the amalgamous data has been 
omitted. 
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These plots show the EVI plotted against the 
midcell pixel number to illustrate the distri- 
bution changes as different solar correction/ 
standardization techniques are applied. All 
plots use the screened July data for illus- 
tration. The EVI is uncorrected and unstandard- 
ized in (a); the standard Landset MSS correction 
(1.0/cos 9) has been applied in (b); and the EVI has 
been solar corrected and standardized to 39° in (c). 
Note how the EVI becomes increasingly more bias 
as corrections and standardizations are applied. 

4-6 The above tables show the results from the tests 
of equality between variances (a) and means (b) 
at the a = 0.05 level of significance for un- 
screened and uncorrected EVI's in July, 1980. 

Note: * - indicates means or variances are 
significantly different and the null hypothesis 
of equality should not be accepted; NS - denotes 
no significant difference and the null hypothesis 
should be accepted. 

4-7 The above tables show the results from the tests 
of equality between variances (a) and means 
(b) at the « = 0.05 level of significance for 
screened and uncorrected EVI's in July 1980. 

Note: ^ = indicates means or variances are 
significantly different and the null hypothesis 
of equality should not be accepted; NS - denotes 
no significant difference and the null hypotheses 
should be accepted. * 

4-8 A plot of screened and unscreened scene mean EVI's 
over the Illinois study site by date for the July 
data only. July 9th and 10th screened EVI's are 
not plotted as only 3 and 6 observation, respect- 
ively, were left after screening. 
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CLOUD SCREENING AND SOLAR CORRECTION INVESTIGATIONS ON THE 
INFLUENCE OF NOAA-6 ADVANCED VERY HIGH RESOLUTION RADIOMETER 
DERIVED VEGETATION ASSESSMENT 


I. INTRODUCTION 

The use of remotely sensed data to identify, mensurate and assess terres- 
trial landcover has accelerated exponentially since the end of World War 

II. Black and white and black and white infrared aerial photography 
progressed to color and color infrared photography along with improvements 
in film resolution. Aircraft which were able to obtain increasingly higher 
altitudes offered greater areal coverage of the earth's surface. Subse- 
quent terrestrial and atmospheric remote sensing instruments evolved to 
aircraft and spacecraft borne analog and digital scanners, radars and 
cameras with superior radiometric, spatial and temporal resolution. The 
Earth Resources Technology Satellite (ERTS), launched in 1972, housed a 
return beam vidicom (RBV) and a mul tispectral scanner (MSS). These instru- 
ments were intended to provide useful and timely data to analyze landcover 
throughout the globe. ERTS was later renamed Landsat as the system evolved 
in sophistication. 

The National Aeronautics and Space Administration (NASA) implemented the 
Large Area Crop Inventory Experiment (LACIE) primarily to identify and 
inventory world wheat production using Landsat MSS data. One significant 
conclusion of the experiment was that global vegetation conditions could be 
monitored with MSS data (NASA, 1978). Following LACIE the Agriculture and 
Resources Inventory Surveys Through Aerospace Remote Sensing (AgRISTARS) 
project was implemented to research, develop, evaluate, and apply aerospace 
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remote sensing for agricultural resources. Through the AgRISTARS program, 
with its multi-agency participation, the National Oceanic and Atmospheric 
Administration (NOAA) meteorological satellites (Metsat) were found to be 
useful for determining environmental conditions (Gray and McCrary, 1981). 
The first Metsat was the Television Infrared Observation Satellite (TIROS) 
which was launched in 1960. Additional instrumentation and new techniques 
have been incorporated in later Metsats which not only provided increased 
weather information but also provided excellent data for other disciplines 
as well (Horvath, et al , 1982). The most current Metsats, NOAA-6 and 
NOAA-7, have onboard four sensor systems; the Advanced Very High Resolution 
Radiometer (AVHRR), the TIROS Operational Vertical Sounder (TOVS), the Data 
Collection System (DCS), and the Space Environmental Monitor (SEM). The 
primary sensor, the AVHRR, operates in a global area coverage (GAC) mode or 
a local area coverage (LAC) mode and can provide real-time transmission or 
digital tape recordings for playback at a later time (Gray, et al , 1981). 

NOAA-6 and NOAA-7 AVHRR sensors provide daily (occasionally twice daily) 
coverage for any given point on the earth's surface at a resolution, at 
nadir, of one kilometer for LAC and four kilometers for GAC. Landsat's MSS 
sensor provides 18 day (9 day when two satellites are in operation) 
coverage for any given location at a resolution of 79 meters (.079 km). 

This contrast in sensor coverages and resolution provides exceptional 
temporal and spatial data for environmental analysis. 

The Early Warning and Crop Condition Assessment Project of the AgRISTARS 
program is using the temporal/resol ution contrast between Metsat and 
Landsat in research to develop and test techniques which will make possible 
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or enhance operational methodologies for crop condition assessment. The 
results of this research will be used primarily by the Foreign Agriculture 
Service (FAS) of the USDA which is responsible for providing early warning 
of changes which may affect foreign crop production, its quality, and for 
assessing crop conditions (NASA, 1982). 

This paper presents results obtained under task fourteen; effects of 
spatial resolution on information content, which is under the Spectral 
Analysis and Assessment project of the 1982-83 Project Implementation Plan 
for the Early Warning and Crop Condition Assessment Project of AgRISTARS 
(Boatwright, 1981). Specifically this paper shows how adverse effects on 
vegetative index numbers derived from Metsat AVHRR data can be reduced. 

1.1 Background 


Vegetation index numners derived daily from NOAA-6 exhibit a high degree of 
variability as seen in Figure 1-1. The accuracy of an index will always be 
questionable to some extent but an index should be precise. Figure 1-1 
shows that within-the-scene daily fluctuations in the data are not precise 
nor can they be accurate. 

There also exists the question of daily comparability. Should some method 
of relative standardization be employed to compare indexes that are tempor- 
arily separate? An index being used to monitor crop conditions through the 
year should be reliable from one observation to the next. If conditions 
are near identical between two observations the indexes should also be near 
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Figure 1-1. This plot shows the highly variable scene mean EVI's, 

uncorrected, over the Illinois study site in July, 1980. 
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Identical for that crop regardless of satellite position, sun position or 
atmospheric conditions. 


Horvath, et al , (1982) presents a comprehensive study of channels one and 
two of NOAA-6 AVHRR data. This study has indications of index instability. 
Apparent from this study, the greatest amount of instability can be attri- 
buted to clouds in the scene. 


1.2 Scope 


This .-aper documents the efforts conducted to establish a method to iden- 
tify adverse effects on AVHRR derived vegetative index numbers (VINs). 

Once these adverse effects were identified, procedures were established to 
correct or remove these adverse effects on AVHRR derived VINs. These pro- 
cedures should be easily implemented on the operational data base of the 
USDA Foreign Agriculture Service. 


1 . 3 Assumptions and Constrair.'^s 


Procedures developed during this study are intended to be implemented on 
the USDA Foreign Agriculture Service's (FAS) operational data base 
(CADRE)*, hence, the major constraint is working within the IJ grid system 
that CADRE is spatially divided into. Over the study area these IJ grid 
cells are approximately 25 nautical miles square. All vegetation index 
numbers, model estimates, crop condition assessments or soil moisture 
conditions are all reduced to these 625 square natuical mile IJ grid cells. 
Because of the enormous size of the database spatial elements, one 
assumption made was any given IJ grid cell cannot significantly change in 
one day. Perhaps during the course of a week an area the size of an IJ 
grid cell can change. One exception to this assumption would be any 
adverse event. These events include rainstorms, snowstorms, high winds, 
etc. 


Another assumption that must be made is that the AVHRR sensor provides 
stable radiometric data. With these assumptions in mind, any adverse 
effect on AVHRR data and subsequent indexes are due to atmospheric 
conditions, sensor scan angles, and/or solar zenith angles. 


* CADRE - Crop Asssessment Data Retrieval and Evaluation Database 
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1.4 Study Area 


The study area lies nearly completely in the southern three-quarters of the 
state of Illinois. The study area consists of 81 IJ grid cells arranged in 
a seven by eleven pattern with two additional cells on the north side of 
the northwest corner of the seven by eleven grid area and two additional 
cells on the east side of the northeast corner as shown in Figure 1-2. The 
study area also contains small portions of Missouri, Kentucky, Indiana and 
Iowa. 

The natural vegetation of the area consists primarily of Oak-Hickory 
forests and B1 uestem prarie with smaller amounts of Oak-Hickory-Pine forest 
and Southern floodplain forests in the southern portion of the study site. 
Coirsnercial agricultural use includes cropland in the central and north- 
eastern areas and cropland with pasture and forests in the southern and 
western portion of the study area. Specifically, corn is the dominant crop 
type grown north of an east-west line through the St. Louis vicinity and 
wheat is primarily grown south of the line (Horvath, et al , 1982). 

Major physiographic features consist of smooth plains in the central and 
northeastern portions where the majority of the corn crop is grown. The 
southern and western areas consist of irregular plains where the primary 
wheat production is located. Topographic relief ranges from 500 to 1000 
feet above sea level. 
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Figure 1-2. The study cells. This site consists of 81 IJ grid cells, as 
shown, in the southern three-quarters of Illinois. 
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2.0 METSAT AND METSAT DERIVED VEGETAL INDEX NUMBERS 


Indexes to indicate standing green biomass quantity and/or quality were 
first determined (and still are) from remote platforms with aerial photo- 
graphy. However, vegetation index numbers (VINs) are usually associated 
with Landsat MSS data. Recently though, VINs have been derived from Metsat 
data (Gray and McCrary, 1981). This section discusses some of the attri- 
butes of Metsat and the VINs derived from Metsat AVHRR data. 

2.1 POES Operational Data Flow 

Polar orbiting environmental satellites* (POES) have been operational since 
October 1978. The POES system is operationally designed to monitor global 
atmospheric phenomena. There are contrasting as well as complementary 
attributes of both the Landsat MSS and Metsat AVHRR (See table 2-1). Land- 
sat MSS data is reflective in all four channels while Metsat AVHRR is re- 
flective in two channels, emmitive in one channel and both reflective and 
emmitive in one channel. (Note; there is no sensible reflected energy 
during a nighttime pass). Because of the success of establishing VIN's with 
Landsat MSS data (reflective) the Early Warning and Crop Condition Assess- 
ment project of AgRISTARS has explored the utility of using the reflective 
channels of NOAA-6 and NOAA-7 to derive VIN's with greater frequency and at 
a coarser resolution. 


*P0ES is a term to designate a family of satellites based on TIROS-N. This 
family includes TIROS-N, NOAA-6 and NOAA-7 meteorological satellites. 

Met sat s. 
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Table 2-1 - A comparison of selected characteristics of Landsat's multi- 
spectral scanner (MSS) to Metsat's Advanced Very High 
Resolution Radiometer (AVHRR) in the local area coverage (LAC) 
mode. 


CHARACTERISTIC 

LAMDSAT (MSS) 

METSAT (AVHRR) 

Spectral Band Response 

1: 0.50 - 0.60 

1: 0.55 - 0.70 

{n m) 

2: 0.60 - 0.70 

2: 0.70 - 1.10 


3: 0.70 - 0.80 

3: 3.55 - 4.00 


4: 0.80 - 1.10 

4: 10.50 - 11.50 

Ground Resolution 

79 M 

1100 M 

(IFOV) 



Scan Width 

185 kM 

2900 kM 

(FOV) 

(+5.78°) 

(+55.4°) 

Temporal Coverage 

18 day 

1 day 

Altitude 

920 kM 

832 kM 
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The following steps are taken to make NOAA-6 AVHRR LAC data available to be 
processed at the Early Warning and Crop Condition Assessment project office 
in Houston, Texas. 


f The AVHRR sensor onboard NOAA-6 receives spectral data and records or 
transmits digital counts. 

9 Data counts are received and recorded on high density digital tapes 

(HDDT) at Wallops Island, Virginia; Gilmore Creek, Alaska; San Francisco, 
California or Honolulu, Hawaii. 

9 The data on HDDT's are then sent by microwave transmission to NOAA's 
National Meteorological Center in Suitland, Maryland where the HDDT's are 
processed into LAC and GAC tapes by the National Environmental Satellite 
and Data Information Service (NESDIS) of NOAA which is responsible for 
archiving and distributing LAC and GAC digital tapes. 

e Digital LAC tapes are received by the Early Warning Crop Condition 

Assessment project office in Houston, Texas where they are displayed and 
reviewed for quality and areas of interest. 

• Acceptable tapes or portions of tapes are processed onto the USDA Foreign 
Agriculture Service's PDF 11/70 disk files which are divided into IJ grid 
cells* of interest. 

• These disk files are further processed to compute VIN's, etc., based on 
user requirements. 


The above steps are completed within five to ten days thus facilitating near 
real time processing. 


*IJ grid cells are derived from dividing a polar stereographic projection of 
each hemisphere into 512 by 512 cells. Dimensions of the cells range from 
29 miles square near the poles to 14 miles square near the equator (Charney 
and Phillips, 1953). 
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2.2 Metsat Derived Indexes 


In either an operational or research mode there are three common indexes 
calculated using four channel AVHRR data. The following briefly discusses 
the theory behind each index and illustrates its calculation. 

2.2.1 Environmental vegetation index 

The environmental vegetation index* (EVI) is the difference between the 
infrared reflectance (Channel 2) and the visible reflectance (Channel 1). A 
complete discussion is presented by Horvath, et al , (1982). Mathematically: 

EVI = R 2 - Ri 

where Rg and R^^ represent the reflectance derived from channel two and 

channel one, respectively. There are also two commonly used derivations of 

the EVI. One derivation of the EVI is the solar corrected EVI (EVI ). 

corr' 

The solar correction, one over the cosine of the solar zenith angle, is a 
commonly applied solar correction used with Landsat and is calculated: 

^^^corr = ^*^2 “ *^1^ * 9) 


* Also knov/n as GMI (6ray-McCrary Index) 
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where 0 Is the solar zenith angle. The other derivation of the EVI is the 
standardized/solar corrected EVI (EVI*) where the solar corrected EVI is 
standardized to a common look angle, usually 39 degrees, that of Landsat. 
The EVI* is calculated: 

EVI* = (Rg - Rj)*(1.0/C0S 0)*(COS 0/COS 9) 


where 0 is the standard angle. 

2.2.2 Normalized vegetation index 

The Normalized Vegetation Index (NVI) was an attempt to reduce between scene 
variability in VIN's. The principle of this VIN calculation is based on the 
idea of mathematical standardization which is to substitute this normalized 
ratio for reflectance (Cate, et al , 1981). The NVI is calculated: 

NVI = (Rg - Rj)/(R 2 + Ri) 

where R 2 and Rj is the reflectance derived from channel two and channel one, 
respectively. 
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2.2.3 Cloud indicator index 


The cloud indicator index (CII)* is calculated from the thermal channels. 
Channel four is a direct measure of radiant temperature of the emitting 
target. Channel three is composed of both reflective and emittive com- 
ponents. Also, the channel three band width covers a water absorption 
windov/. Complete exhaustive analysis of channels three and four is 
currently underway and is beyond the scope of this project. However, the 
CII is believed to be useful in indentification of clouds. It is assumed 
the Channel three transformation is a general measure of wetness and channel 
four is a general measure of temperature. Therefore, cold, wet objects 
(clouds) are identifiable using this index. The CII is the difference 
between channel four and channel three: 

CII = E 4 - (100 * E 3 ) 


where E^ and E 3 are the energy in milliwatts per square meter per steradian 
per centimeter as derived from channel four and channel three counts 
respectively. 


* Also known as the McCrary-Gray Index (MGI) 
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3.0 ANALYTIC PROCEDURES 


The data acquired for this study is the complete set of data originally 
used by Horvath, et al , (1982) to investigate the reflective channels of 
Metsat's AVHRR. Horvath, et al , indicated this representative data set 
possessed some instability diurnally as well as daily. This study was also 
in support of a study to determine the information content using variable 
pixel (cell) sizes. A detailed statistical investigation is beyond the 
scope of this study because during the preliminary analysis stage, and 
since, the NOAA-6 AVHRR sensor has failed to provide significant data. The 
NOAA-7 AVHRR sensor has completely replaced, rather than supplement, NOAA-6 
AVHRR data and the satellite attributes are of such a difference only a 
cursive investigation is within the scope in order to identify “universal" 
situations or conditions which may be transferable to NOAA-7 and NOAA-8 
technology. 

3. 1 Preliminary Investigations 

The dataset used for a cursory examination covered the dates July 9, 1980 

to July 15, 1980, with two observations on July 13, 1980; October 8, 1980 

and October 10, 1980 through October 12, 1980. This dataset consisted of 

the EVI*, I and J grid numbers, the date, solar zenith angle and pixel 

numbers for the cell center and I-and K-counts. I-counts are a count of 
all the pixels in a given IJ grid cell. K-counts are a count of the pixels 
used in the calculation of the EVI* for a grid cell. The difference 
between I-counts and K-counts is the number of pixels omitted by the 
thresholding technique used. Later investigations were unable to identify 
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the thresholding technique used. However, this thresholding technique was 
originally intended to identify pixels containing clouds and visually 
appeared to do so. 

Classical statistical t and F tests were performed on the scene means and 
variances, using this dataset, testing to find which, if any, adjacent 
dates have the same statistical mean and/or variance. Considering only the 
eight observations in July, 1980 and assuming the scene cannot significant- 
ly change from day to day only three variances and none of the means of the 
scene EVI*'s were statistically the same. 

From a visual interpretation of the histograms of EVI*'s it is evident that 
essentially cloud free days (July 14, July 15) present a near normal 
distribution. However, dates containing substantial amounts of clouds 
(July 9 and July 10) exhibit a definite bi-model distribution of EVI*'s. 

From scatterplots of nearly every variable combination in this small data- 
set evidence was found which indicated thresholds exist to identify clouds. 
Also, there seemed to be unexplained trends in the scatterplots. 


The above explained relationships provided enough evidence to warrant 
development of a complete computer compatible data base for further 
analysis. 
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3.2 Data Base Develoiiment 


Due to the inability to identify the thresholds applied to Channel one, 
Channel two or the subsequent EVI* used in the cursorv examination, a new 
data base was developed over the same area during the same dates which 
contained data with no thresholds applied. The data base consists of: 

• the Julian date, 

• I and J cell numbers, 

• the mean for each of the four channels per IJ grid cell with Channels 
one and two in percent reflectance and channels three and four in 
degrees Kelvin, 

• the solar zenith angle at the IJ cell center, 

V the pixel number at the IJ cell center, 

• the K/I ratios, 

• the minimum and maximum daily temperatures, and 
9 the daily precipitation. 

This data base lends itself well to the calculation of EVI, EVI*, 

NVI and Cl I with or without thresholding or correcting of the values. 
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3.3 Statistical and Graphical Analysis 


The approach to the analysis of this data has been kept reasonably simple 
for two reasons. First, the NOAA-6 AVHRR sensor has not provided useful 
data since the summer of 1982. Secondly, significant findings from this 
analysis intend to be easily implemented on an operational global vegeta- 
tion analysis system by the USDA Foreign Agricultural Service. To keep 
within the intended scope of this study, a simple examination of the 
spectral profile should suffice. Assessment of data information content 
was based on such an examination. 

Student t statistics were computed to test equality of the scene means 
between all possible dates, with particular attention given to consecutive 
acquisition dates. Null hypotheses of equal means betv/een all possible 
dates with an acceptance level of five percent (two tail) was used. Also, 

F statistics were computed to test variance equally for all possible dates. 
Null hypotheses of equal variances between all possible dates with an 
acceptance level of five percent (two tail) was used. It was necessary to 
complete variance equality tests first in order to select the proper 
equation to use when testing the means. The tests of means and variances 
were computed on the data set as it was and also after any application of 
thresholding techniques. This gave an indication of how the sample 
distribution behaved before and after thresholding procedures were applied. 


Scatterplots were generated for many combinations of variables to identify 
relationships that could be used to identify and eliminate adverse effects 
on the AVHRR derived VIN's. Scatterplots were regenerated using the 
thresholding techniques to verify the utility of the threshold. 

Scatterplots were also generated to determine what effect, if any, solar 
CO. f ret ions had on the information content of the data or the corrected 
VINs. 
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4.0 RESULTS AND CONCLUSIONS 


This section discusses the significant findings related to information 
content of NOAA-6 AVHRR data. Methods to screen data for cloud 
contamination based on the spectral profile were established. Effects of 
solar correcting algorithms were determined. Also shown in this section 
are the results of the statistical tests describing the population before 
and after screening techniques were applied. 

4.1 Identification of Cloud Contaminated Cells 


The database contained two complementary variables, namely the cloud 
density code (CDC) and the cloud indicator index (CII). The cloud density 
code was derived from a previous iteration of the database from K/I ratios. 
The K/I ratio was the ratio of the number of pixels wt^neh passed some 
undefined threshold to the total number of pixels in an IJ grid cell. This 
ratio was then coded using the coding scheme shown in table 4.1. The 
threshold was used by the USDA Foreign Agricultural Service to identify 
clouds and other amalgomous non-vegetal features based on the EVI of each 
pixel. The CII, however, is intended to identify clouds based on the 
thermal channel means of each IJ cell. Figure 4.1 shows a plot of the 
cloud density code versus the cloud indicator index. Figure 4.1 also shows 
that a CII in the range from 30 to 40 will put the majority of the cloud 
contaminated cell numbers at less than 30 to 40 while uncontaminated cells 
would have index numbers greater than 30 to 40. A CII of 37.0 (second 
ordinate in figure 4.1) gave the best visual separation between the cloud 
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Table 4.1 - Cloud codes and their derivations. The cloud code assigned to 
an IJ grid cell was dependent on the K/I ratio; i.e., the ratio 
of the remaining pixels after applying a threshold to the total 
number of pixels in the cell. 


CLOUD 

CODE 

K-I RATIO RANGE 
(percent) 

0* 

.9998 - 1.000 no clouds 

1 

.9875 - .9997 

2 

.8125 - .9374 

3 

.6875 - .8124 

4 

.5625 - .6874 

5 

.4375 - .5624 

6 

.3125 - .4374 

7 

.1875 - .3124 

8 

.0625 - .1874 

9 

.0000 - .0624 completely cloud covered 


* a cell which had been assigned a code of zero was changed to -2 if any of 
the neighboring cells had a cloud code equal to 1, if any neighboring 
cells had C'^des greater than 1 the cell was assigned the value -1. The 
code of zero remained if none of the neighboring cells contained cloud 
codes. 
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contaminated cells and the uncontaminated cells. This produced approxi- 
mately 85 percent agreement between the CDC and the Cl I for both the July 
and October portions of this database; therefore, a threshold CII of 37.0 
was used to identify cloud contaminated cells in the database. It is 
beyond the scope of this study to fine tune this threshold. Increasing the 
CII threshold will eliminate more cloud contaminated cells, but will also 
reduce, to a greater extent, the number of uncontaminated cells. Lowering 
the CII thi^eshold will increase the number of uncontaminated cells and also 
increase the number of contaminated cells. Wiien the EVI was plotted 
against the midcell pixel number associated with it to observe its distri- 
bution, both with and without the CII threshold of 37.0 being applied, it 
was noteo that EVI's with associated midcell pixel numbers less than 200 
failed to adhere to the conformity of the EVI's with midcell pixel numbers 
greater than 200. Therefore, an ancillary threshold on the midcell pixel 
number of 200 was incorporated*. The effects of the two screening methods 
are illustrated in Figure 4.2a and Figure 4.2b. Figure 4.2a is a display 
of the EVI calculated from the July channel means against the midcell pixel 
number without any thresholds applied. The lack of EVI values less than 
two between midcell pixel numbers 500 and 1000 is due to the fact that the 
acquisitions collected when the study site was in the 500 to 1000 pixel 
number range were cloud free. Figure 4.2b displays the EVI's calculated 


*This scan line cut off, plus more drastic scan line cut backs are 
supported by Johnson (1983) and Horvath, et al , (1982). 


4-4 



OF POOR QuAirrif 


20 sa-H 


tsaa-4 


6«Q-i 


* Z 4 . ** 

•»••*■ ^ ♦+4. 

- * w 


<s 


5$;0M *’■;♦! ^V 

“*v. 



**■ *•’* * < i%* V* *Mi»* *.** 


a. 


-i» -a -0 ~4 -2 a 2 4 8 8 

EMVIROMnENTAl. VECCTA-OOW lMDO{ <EVI5 


10 


12 


P 

X 

X 

E 

L 

N 

U 

M 

0 

E 

a 



b. 


-10 


».. ^^W ■■IW■ ^ ■ ■ ^ W ■■■■ H I H| |. 1, . ^, . MH^„^ , „„.. ., I 

- 0 - 0 - 4-002400 

raMaK689«si^A&. veaeTATxsn xisssx eevss 


w oyi0 0 0iii 0 W 0 ^ I 

to t2 



Figure 4.? - (a) - a graph of midcen pixel numbers against EVI's calcu- 
lated from the July channel means, n = 648; (b) is the same 
July EVI values against midcell pixel numbers with a Cl I 
threshold of ^ 37.0 and a midcell pixel number threshold 
of ^ 200, n = 312. Notice the reduction of variability of 
the EVI in graph (b) compared to graph (a). 
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from the July channel means after the Cl I threshold of less than or equal 
to 37.0 and the midcell pixel number threshold of less than or equal to 200 
was applied. It appears as though the Cl I threshold has the same effect as 
an EVI threshold of approximately five; however, this is not true. Grid 
cells with EVI's as high as nine were omitted due to the CII threshold. 

What appear to be gaps due to natural clustering of the EVI's along the 
scan line are due to lack of data, i.e. during these eight acquisitions the 
study site was not found along those portions of the scan line. 

It is also interesting to note the effect of screening on the reflective 
and thermal channels themselves. Figure 4.3 is a plot of the reflectance 
in channel one against the reflectance in channel two before and after 
screening. In figure 4.3a, which is a plot before screening, the bulk of 
the data is clustered below 15 percent reflectance in channel two and below 
ten percent in channel one with a large "tail" of data trailing away from 
the cluster with Increased reflectance. ' Figure 4.3b, which is a plot after 
screening techniques had been applied, illustrates how the bulk of the data 
of low reflectance remained and the tail of high reflectance data was 
omitted. Since natural terrestrial objects have relatively low reflectance 
in the reflective portion of the spectrum, this is a good indication that 
the screening techniques are identifying clouds and possibly hazy condi- 
tions which have a higher reflectance than natural terrestrial objects. 

This figure (4.3) may support an argument to screen (threshold) data based 
on channel one reflectance of approximately 9 or 10 percent. Since the 
channel two reflectance varies during a growing season according to the 
amount of green bicmiass present, a similar argument cannot be made using 
channel two. 
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Figure 4.3 - plots of the July channel 1 cell means against channel 2 cell * 
means without (a) and with (b) thresholds of Cl I < 37 and 
midcell pixel numbers ^ 200. Notice how the bulk of the 
tightly clustered data with low reflectance in Channels 1 and 
2 remained after thresholds were applied and the less 
clustered more amalgamous data has been omitted. 

♦reflectance - percent albedo. 
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Figure 4.4 is a plot similar to figure 4.3 except that the thermal channels 
are plotted against each other in energy (mWm sr cm). The results in the 
thermal channels are similar to the reflective channels in eliminating the 
amalgamous data present (Fig. 4.4a) and leaving behind the bulk of tne 
clustered data after screening (fig. 4.4b). The amalgamous data for the 
most part is colder and v^tter, which for the most part are clouds or 
atmospheric haze. 

4.2 Effects of Solar Correction/Standardization on the EVI 


The information content of the EVI can be significantly altered by applying 
solar corrections and/or standardization techniques. In figure 4.5 the 
EVI, EVI^-^„, and EVI* are calculated from the July data and are plotted 
against the midcell pixel number to illustrate the changes in the EVI and 
its derivations along a scan line. A scan line also has a constantly 
changing solar zenith angle associated with it. The solar corrected and 
standardized ,EVIs the EVI* (fig 4.5c), is extremely biased. For example, 
it would be impossible to obtain an EVI* less than 20 when the midcell 
pixel number is less than 500. Nearly any EVI value within the dynamic 
range of EVI's (fig. 4.5a) can be obtained regardless of midcell pixel 
number. The EVI*, which is supposed to be an indication of vegetal 
conditions, is more dependent on satellite positions (or its solar zenith 
angles) than it is on actual ground conditions. 
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Figure 4.4 - plots of the July channels 3 cell means against channel 4 cell 
means without (a) and with (b) thresholds of CII < 37 and mid- 
cell pixel numbers < 200. Notice how the bulk of the tightly 
clustered data remained after the thresholds were applied and 
the amalgamous data has been omitted. 
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Using simple linear regression to model EVI and EVI* against the midcell 

pixel number, a coefficient of correlation (R ) can be calculated which 

measures the amount of the error explained by the model. In the case of an 

indicator index, the model's independent variables should be independent 

(uncorrelated) of the index (dependent variables). When the midcell pixel 

number is used to model the EVI* the coefficient of correlation was 75 

percent which could be broadly interpreted as an indication that tne 

midcell pixel number is nearly as good as the EVI* at indicating vegetal 

2 

conditions. When the EVI is regressed on the midcell pixel number an R 
value of two percent was obtained indicating the EVI is independent 
(uncorrelated) to the mid cell pixel number. 

4.3 Mean and Variance Equality Tests 

Based on the assumptions described in Section 1.3, it is not likely that an 
IJ grid cell (approximately 530,000 acres) would have a significant change 
in illumination or reflectance properties between acquisitions, approxi- 
mately 24 hours. Under these assumptions the scene means and variances 
would be more or less identical from one day to the next. This is not the 
case. The biggest impact on the daily means and variances is due to clouds 
and other atmospheric interferences as shown in figure 4.6. This figure 
shows the results of tests of equality of scene means and variances of all 
July EVI values. After applying the CII threshold and midcell pixel number 
threshold to the July EVI scene means and variances, there is an 
approximately 30 percent increases in the percentage of scene means and 
variances which do not exhibit a significant difference, as shown in figure 
4-7. 
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a. Results from the test of variance equality. 
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b. Results from the test of mean equality. 


Figure 4.6 The above tables show the results from the tests of equlaity 
between variances (a) and means (b) at the a = o.05 level of 
significance for unscreened and uncorrected EVI's in July, 1980. 
Note: * - indicates means or variances are significantly different 
and the null hypothesis of equality should not be accepted; NS - 
denotes no significant difference and the null hypothesis should be 
accepted. 


4-12 






192 

193 

194 

195a' 

195b 

196 

197 

Day 

NS 

NS 

NS 


'Hi 

NS 

NS 

191* 


NS 



NS 

NS 

NS 

192* 



n 


Bi 

NS 

* 

193 





* 

* 

NS 

194 







195a 





* 

* 

195b 






* 

196 


a. Results from the test of variance equality. 


192 

193 

194 

195a 

195b 

196 

197 

Day 

NS 

NS 

Bi 


Bli 

NS 

NS 

191 


NS 

m 


NS 

NS 

★ 

192 



B 


NS 

NS 

* 

193 





NS 

* 

NS 

194 







195a 





NS 

NS 

195b 






* 

196 


b. Results from the test of mean equality. 

Figure 4.7 - The above tables show the results from the tests of equality 
between variances (a) and means (b) at the a = 0.05 level of 
significance for screened and uncorrected EVI's in July, 1980. 

Note: * - indicates means or variances are significantly different 

and the null hypothesis of equality should not be accepted; NS - 
denotes no significant difference and the null hypothesis should be 
accepted. 


1 - all IJ grid cells during this observation had pixel numbers less than 

200 . 

2 - day 191 and 192 had 3 and 6 observations respectively after screening. 
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The screening methods discussed in this section improve the reliability of 
the EVI as seen in the means in figure 4-8 and the variances in table 4-2. 
These types of improvements in information content of data are necessary in 
operational databases or in studies requiring comparisons temporally, such 
as large scale change monitoring. 
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Figure 4-8. A plot of screened and unscreened scene mean EVI's over the 

Illinois study site by date for the July data only. July 9th and 
10th screened EVI's are not plotted as only 3 and 6 observations, 
respectively, were left after screening. 
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Table 4-2. Tills table is a summary of descriptive statistics for the three 
types of unscreened (a) and screened (b) EVI VIN's for the July 
observations, over Illinois. 


(a) Unscreened 


VIN 

MIN 

MAX 

MEAN 

VAR 

N 

EVI 

-8.0 

10.3 

4.65 

21.39 

648 

corr 

. 

o 

1 

20.5 

8.10 

61.38 

636 

EVI* 

-18.8 

38.1 

10.98 

121.99 

636 


(b) Screened 


VIN 

MIN 

MAX 

MEAN 

VAR 

N 

EVI 

5.4 

10.3 

7.94 

1.04 

312 

EVI 

corr 

7.8 

19.7 

13.18 

5.36 

312 

EVI* 

8.7 

30.9 

17.20 

20.92 

312 
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